Algorithmic embeddings

نویسنده

  • Mihai Badoiu
چکیده

We present several computationally efficient algorithms, and complexity results on low distortion mappings between metric spaces. An embedding between two metric spaces is a mapping between the two metric spcaes and the distortion of the embedding is the factor by which the distances change. We have pioneered theoretical work on relative (or approximation) version of this problem. In this setting, the question is the following: for the class of metrics C, and a host metric M ′, what is the smallest approximation factor a ≥ 1 of an efficient algorithm minimizing the distortion of an embedding of a given input metric M ∈ C into M ′? This formulation enables the algorithm to adapt to a given input metric. In particular, if the host metric is “expressive enough” to accurately model the input distances, the minimum achievable distortion is low, and the algorithm will produce an embedding with low distortion as well. This problem has been a subject of extensive applied research during the last few decades. However, almost all known algorithms for this problem are heuristic. As such, they can get stuck in local minima, and do not provide any global guarantees on solution quality. We investingate several variants of the above problem, varying different host and target metrics, and definitions of distortion. We present results for different types of distortion: multiplicative versus additive, worst-case versus average-case and several types of target metrics, such as the line, the plane, d-dimensional Euclidean space, ultrametrics, and trees. We also present algorithms for ordinal embeddings and embedding with extra information. Thesis Supervisor: Piotr Indyk Title: Associate Professor

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stability of intersections of graphs in the plane and the van Kampen obstruction

A map φ : K → R2 of a graph K is approximable by embeddings, if for each ε > 0 there is an ε-close to φ embedding f : K → R2. Analogous notions were studied in computer science under the names of cluster planarity and weak simplicity. In this survey we present criteria for approximability by embeddings (P. Minc, 1997, M. Skopenkov, 2003) and their algorithmic corollaries. We introduce the van K...

متن کامل

Design of a parallel interconnect based on communication pattern considerations

An interconnect's versatility appears through its ability to support a variety of algorithmic patterns based on the physical or logical embeddings within its topology that match the desired algorithmic patterns. However, most such embeddings are available on a discrete basis, though a particular algorithm may require a variety of embeddings during diierent phases of its operation. To provide fo...

متن کامل

Algorithmic and asymptotic properties of groups

This is a survey of the recent work in algorithmic and asymptotic properties of groups. I discuss Dehn functions of groups, complexity of the word problem, Higman embeddings, and constructions of finitely presented groups with extreme properties (monsters). Mathematics Subject Classification (2000). Primary 20F65; Secondary 20F10.

متن کامل

Learning Low-Dimensional Representations of Medical Concepts

We show how to learn low-dimensional representations (embeddings) of a wide range of concepts in medicine, including diseases (e.g., ICD9 codes), medications, procedures, and laboratory tests. We expect that these embeddings will be useful across medical informatics for tasks such as cohort selection and patient summarization. These embeddings are learned using a technique called neural languag...

متن کامل

Multiclass Learning by Probabilistic Embeddings

We describe a new algorithmic framework for learning multiclass categorization problems. In this framework a multiclass predictor is composed of a pair of embeddings that map both instances and labels into a common space. In this space each instance is assigned the label it is nearest to. We outline and analyze an algorithm, termed Bunching, for learning the pair of embeddings from labeled data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006